according to such an idea, we propose a new retrieval method that combines xpath and vector space model, named as the vector retrieval model based on xpath . secondly, we make full use of the hierarchical architecture of xml data, and analyze the structure of every document to construct a structure thesaurus, which is designed to navigate the user query and to eliminate the structural conflict 根據(jù)這一思想,作者提出了將xpath語言與傳統(tǒng)的向量空間模型相結(jié)合,實(shí)現(xiàn)基于簡單xpath路徑的向量檢索算法來實(shí)現(xiàn)對xml文檔的檢索。充分利用xml文檔分類層次體系結(jié)構(gòu)的特點(diǎn),對于每篇xml文檔分析其文檔結(jié)構(gòu),并采用聚類學(xué)習(xí)算法形成文檔結(jié)構(gòu)類屬詞典,從而實(shí)現(xiàn)xml文檔查詢的導(dǎo)航機(jī)制和消除文檔結(jié)構(gòu)的異構(gòu)性。